Pesquisa | Portal Regional da BVS

Global prediction of extreme floods in ungauged watersheds.

Nearing, Grey; Cohen, Deborah; Dube, Vusumuzi; Gauch, Martin; Gilon, Oren; Harrigan, Shaun; Hassidim, Avinatan; Klotz, Daniel; Kratzert, Frederik; Metzger, Asher; Nevo, Sella; Pappenberger, Florian; Prudhomme, Christel; Shalev, Guy; Shenzis, Shlomo; Tekalign, Tadele Yednkachw; Weitzner, Dana; Matias, Yossi.

Nature ; 627(8004): 559-563, 2024 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-38509278

RESUMO

Floods are one of the most common natural disasters, with a disproportionate impact in developing countries that often lack dense streamflow gauge networks1. Accurate and timely warnings are critical for mitigating flood risks2, but hydrological simulation models typically must be calibrated to long data records in each watershed. Here we show that artificial intelligence-based forecasting achieves reliability in predicting extreme riverine events in ungauged watersheds at up to a five-day lead time that is similar to or better than the reliability of nowcasts (zero-day lead time) from a current state-of-the-art global modelling system (the Copernicus Emergency Management Service Global Flood Awareness System). In addition, we achieve accuracies over five-year return period events that are similar to or better than current accuracies over one-year return period events. This means that artificial intelligence can provide flood warnings earlier and over larger and more impactful events in ungauged basins. The model developed here was incorporated into an operational early warning system that produces publicly available (free and open) forecasts in real time in over 80 countries. This work highlights a need for increasing the availability of hydrological data to continue to improve global access to reliable flood warnings.

Assuntos

Inteligência Artificial , Simulação por Computador , Inundações , Previsões , Previsões/métodos , Reprodutibilidade dos Testes , Rios , Hidrologia , Calibragem , Fatores de Tempo , Planejamento em Desastres/métodos

Caravan - A global community dataset for large-sample hydrology.

Kratzert, Frederik; Nearing, Grey; Addor, Nans; Erickson, Tyler; Gauch, Martin; Gilon, Oren; Gudmundsson, Lukas; Hassidim, Avinatan; Klotz, Daniel; Nevo, Sella; Shalev, Guy; Matias, Yossi.

Sci Data ; 10(1): 61, 2023 01 31.

Artigo em Inglês | MEDLINE | ID: mdl-36717577

RESUMO

High-quality datasets are essential to support hydrological science and modeling. Several CAMELS (Catchment Attributes and Meteorology for Large-sample Studies) datasets exist for specific countries or regions, however these datasets lack standardization, which makes global studies difficult. This paper introduces a dataset called Caravan (a series of CAMELS) that standardizes and aggregates seven existing large-sample hydrology datasets. Caravan includes meteorological forcing data, streamflow data, and static catchment attributes (e.g., geophysical, sociological, climatological) for 6830 catchments. Most importantly, Caravan is both a dataset and open-source software that allows members of the hydrology community to extend the dataset to new locations by extracting forcing data and catchment attributes in the cloud. Our vision is for Caravan to democratize the creation and use of globally-standardized large-sample hydrology datasets. Caravan is a truly global open-source community resource.

In Defense of Metrics: Metrics Sufficiently Encode Typical Human Preferences Regarding Hydrological Model Performance.

Gauch, Martin; Kratzert, Frederik; Gilon, Oren; Gupta, Hoshin; Mai, Juliane; Nearing, Grey; Tolson, Bryan; Hochreiter, Sepp; Klotz, Daniel.

Water Resour Res ; 59(6): e2022WR033918, 2023 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-38440056

RESUMO

Building accurate rainfall-runoff models is an integral part of hydrological science and practice. The variety of modeling goals and applications have led to a large suite of evaluation metrics for these models. Yet, hydrologists still put considerable trust into visual judgment, although it is unclear whether such judgment agrees or disagrees with existing quantitative metrics. In this study, we tasked 622 experts to compare and judge more than 14,000 pairs of hydrographs from 13 different models. Our results show that expert opinion broadly agrees with quantitative metrics and results in a clear preference for a Machine Learning model over traditional hydrological models. The expert opinions are, however, subject to significant amounts of inconsistency. Nevertheless, where experts agree, we can predict their opinion purely from quantitative metrics, which indicates that the metrics sufficiently encode human preferences in a small set of numbers. While there remains room for improvement of quantitative metrics, we suggest that the hydrologic community should reinforce their benchmarking efforts and put more trust in these metrics.

Customization scenarios for de-identification of clinical notes.

Hartman, Tzvika; Howell, Michael D; Dean, Jeff; Hoory, Shlomo; Slyper, Ronit; Laish, Itay; Gilon, Oren; Vainstein, Danny; Corrado, Greg; Chou, Katherine; Po, Ming Jack; Williams, Jutta; Ellis, Scott; Bee, Gavin; Hassidim, Avinatan; Amira, Rony; Beryozkin, Genady; Szpektor, Idan; Matias, Yossi.

BMC Med Inform Decis Mak ; 20(1): 14, 2020 01 30.

Artigo em Inglês | MEDLINE | ID: mdl-32000770

RESUMO

BACKGROUND: Automated machine-learning systems are able to de-identify electronic medical records, including free-text clinical notes. Use of such systems would greatly boost the amount of data available to researchers, yet their deployment has been limited due to uncertainty about their performance when applied to new datasets. OBJECTIVE: We present practical options for clinical note de-identification, assessing performance of machine learning systems ranging from off-the-shelf to fully customized. METHODS: We implement a state-of-the-art machine learning de-identification system, training and testing on pairs of datasets that match the deployment scenarios. We use clinical notes from two i2b2 competition corpora, the Physionet Gold Standard corpus, and parts of the MIMIC-III dataset. RESULTS: Fully customized systems remove 97-99% of personally identifying information. Performance of off-the-shelf systems varies by dataset, with performance mostly above 90%. Providing a small labeled dataset or large unlabeled dataset allows for fine-tuning that improves performance over off-the-shelf systems. CONCLUSION: Health organizations should be aware of the levels of customization available when selecting a de-identification deployment solution, in order to choose the one that best matches their resources and target performance level.

Assuntos

Anonimização de Dados/normas , Registros Eletrônicos de Saúde , Aprendizado de Máquina/normas , Conjuntos de Dados como Assunto , Humanos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA